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ABSTRACT 

A variety of approximate schemes for modelling the low-density Intergalactic 
Medium (IGM) in the high-redshift Universe is compared to the results of a 
large high-resolution hydro-dynamical simulation. These schemes use either an 
analytical description of the dark matter distribution and the IGM or numer- 
ical simulations of the DM distributions combined with different approximate 
relations between dark matter field and the gas distribution. Schemes based 
on a filtering of the dark matter distribution with a global Jeans scale result in 
a rather poor description of the gas distribution. An adaptive filtering which 
takes into account the density/temperature dependence of the Jeans scale is 
required. A reasonable description of the gas distribution can be achieved us- 
ing a fit of the mean relation between the dark matter and gas densities in 
the hydro-dynamical simulation to relate dark matter and gas distribution. In 
the hydro-dynamical simulations deviations from this mean relation are cor- 
related with gradients in the dark matter peculiar velocity field indicative of 
shocks in the gas component. A scheme which takes into account this corre- 
lation results in a further improved gas distribution. Such adaptive filtering 
schemes applied to dark matter simulations will be very well suited for studies 
of statistical properties of the Lya forest which investigate the IGM and the 
underlying dark matter distribution and require a large dynamic range and/or 
an extensive parameter study. 

Key words: Cosmology: theory - intergalactic medium - large-scale structure 
of universe - quasars: absorption lines 



1 INTRODUCTION 

The low-density Intergalactic Medium (IGM) offers a 
unique and powerful probe of the high redshift Universe. 
Numerous weak absorption lines seen in the spectra of 
distant quasars (the Lya forest) are produced by the 
small residue of neutral hydrogen in filamentary struc- 
tures intersected by the line of sight (Bahcall & Salpeter 
1965; Gunn & Peterson 1965; see Rauch 1998 for a re- 
cent review). Such structures arise naturally in hierar- 
chical cold dark matter dominated models of structure 
formation, in which the IGM is highly ionised by the 
UV-background produced by stars and galaxies. Simu- 
lations show that the low-column density (Nhi < 10^'*'^ 
cm~^) absorption lines are produced by small fluctu- 
ations in the warm (T ~ 10* K) photo-heated IGM, 
which smoothly traces the mildly non-linear dark mat- 



ter filaments and sheets on scales larger the Jeans scale 
(Cen et al. 1994, Petitjean et al. 1995, Miralda-Escude 
et al. 1996, Zhang et al. 1998). 

This picture is supported by analytical studies 
based on simple models for the IGM dynamics. Such 
models are based on either a local non-linear mapping of 
the hnear density contrast, obtained for example by ap- 
plying a lognormal transformation (Coles & Jones 1991) 
to the IGM (Bi, Borner & Chu 1992; Bi 1993; Bi, Ge 
& Fang 1995; Bi & Davidsen 1997, hereafter BD97), 
or on suitable modifications of the Zel'dovich approxi- 
mation (Zel'dovich 1970), to account for the smoothing 
caused by the gas pressure on the Jeans scale (Reiseneg- 
ger & Miralda-Escude 1995; Gnedin & Hui 1998; Hui, 
Gnedin & Zhang 1997; Matarrese & Mohayaee 2002). 
Many properties of the Lya absorbers can be under- 
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stood with straightforward physical arguments (Schaye 
2001, Zhang et al. 1998). 

The most convincing support for this picture comes, 
however, from the comparison of observed spectra with 
mock spectra computed from hydrodynamical numeri- 
cal simulations (Cen et al. 1994; Zhang, Anninos & Nor- 
man 1995, 1997; Miralda-Escude et al. 1996; Hernquist 
et al. 1996; Charhon et al. 1997; Theuns et al. 1998). 
These simulated spectra accurately reproduce many ob- 
served properties of the Lya forest. Although there are 
still some discrepancies between observed and simulated 
spectra, especially in the Doppler parameters of the ab- 
sorption lines (Theuns et al. 1998; Bryan et al. 1999; 
Meiksin et al. 2001). 

Observationally, the unprecedented high resolution 
observations of the HIRES and UVES spectrographs on 
the Keck and VLT telescopes, as well as observations 
with the Hubble Space Telescope, have lead to great ad- 
vances in the understanding of the Lya forest. HIRES 
allowed to detect lines with column densities as low as 
Nhi ~ 10^^ cm~'^, while HST made a detailed analysis 
of the low-redshift Lya forest at 2; < 1.6 possible. Recent 
results include limits on the baryon density (Rauch et 
al. 1997), the temperature and equation of state of the 
IGM (Schaye et al. 2000; Theuns et al. 2001; Ricotti, 
Gnedin & Shull 2000; Bryan & Machachek 2000, Mc- 
Donald et al. 2001), the power spectrum of the density 
fluctuations (Croft et al. 1998, 2001, Gnedin & Hamilton 
2001, Zaidarriaga et al. 2001), the geometry of the Uni- 
verse (Hui et al. 1999; McDonald et al. 2001; Viel et al. 
2002) and direct inversions of the density field (Nusser 
& Haehnelt 1999, 2000; Pichon et al. 2001). Kim et al. 
(2001) used high resolution VLT/UVES data, to make 
an extensive analysis of the Lya forest in the redshift 
range 1.5 < z < A. Dobrzycki et al. (2001) presented 
results on the clustering and evolution of the lines at 
z < 1.7 using the HST/FOS spectrograph. 

Many aspects of the warm photo-ionised Intergalac- 
tic medium can be well modelled by hydrodynamical 
simulations. Hydro simulations are, however, stiU rather 
limited in dynamic range. The box size of hydro simula- 
tions which resolve the Jeans mass of the photo-ionised 
IGM with a temperature of ~ lO'^K probe the large 
scale fluctuations of the density fleld rather poorly. This 
leads to uncertainties due to cosmic variance in the fluc- 
tuations on scales approaching the box size and missing 
fluctuations on scales larger than the simulation box. 
Because of limited computational resources it is also 
hardly possible to perform extensive parameter studies. 
In order to overcome these problems approximate meth- 
ods for simulating the Lya forest in QSO absorption 
spectra are often used (e.g. Gnedin & Hui 1998; Croft 
et al. 1998; Croft et al. 1999; Meiksin & White 2001). 
We test here a wide range of such approximate methods 
against a large high-resolution hydro-dynamical simula- 
tion. 

The plan of the paper is as follows. In Section 2 
we briefly describe the hydro-simulation used. Section 3 
presents the lognormal model for the IGM and an im- 
proved model based on the implementation of the gas 
probability distribution function taken from the hydro- 
simulation. We improve our modelling in Section kl us- 



ing two different ways of flltering the linear dark matter 
density field to model pressure effects. In this Section 
we present a further improvement, based on modelling 
the gas distribution starting from the non-linear dark 
matter density field obtained from the hydro-simulation 
and we show how the smoothing on the Jeans length of 
the evolved dark-matter density field poorly reproduces 
the real gas distribution. In Section ^ we compare the 
different methods proposed in terms of the 1-point and 
2-points PDF of the flux. Section |^ contains a general 
discussion and our conclusions. 



2 THE HYDRO-DYNAMICAL 
SIMULATION 

In the following Sections we will test our approxi- 
mate schemes to model the Lya forest against a large 
high-resolution hydro-dynamical simulation. We there- 
fore summarise here some parameters of the hydro- 
dynamical simulation used. The simulation techniques 
are described in more detail in Theuns et al. (1998). 
We analyse a total of 7 outputs at redshifts z = 49, 
z = 10, z — 4, z — 3.5, z = 3, z = 2.25, z = 2 of a peri- 
odic, cubic region in a ACDM Universe. The cosmolog- 
ical parameters are: flom ~ 0.3, Qqa ~ 0.7, -ffo = WOh 
km s"^Mpc with h = 0.65 and Qobh'^ = 0.019. The co- 
moving size of the box is 12.0//i Mpc. There are 256^ 
DM particles and 256"^ gas particles, whose masses are 
niDM = 1.13 X IO^Mq and micM = 1.91 x IO^Mq. The 
input linear power spectrum was computed with CMB- 
FAST (Seljak & Zaidarriaga 1996), and normalised to 
the abundance of galaxy clusters using as = 0.9 (Eke, 
Cole & Frenk 1996), where erg denotes the mass fluctu- 
ations in spheres of radius 8h~^ Mpc. 

The simulation code is based on HYDRA (Couch- 
man et al. 1995) and combines Smoothed Particle Hy- 
drodynamics (see e.g. Monaghan 1992) with P3M for 
self-gravity. Spline interpolation over gas particles al- 
lows the computation of smooth estimates for density, 
temperature and velocity and their gradients. The width 
of the spline kernel is matched to the local particle num- 
ber density, in this way high density regions have higher 
numerical resolution than lower density ones. Photo- 
ionisation and photo-heating rates are computed using 
the fits in Theuns et al. (1998). 

Along many randomly chosen sight lines parallel 
to one of the axis of the simulation box, we compute 
the gas and DM (over) density and peculiar velocity, 
the gas temperature, and the neutral hydrogen (den- 
sity weighted) density, temperature and peculiar veloc- 
ity. This allows us to compute the absorption spectra. 
The smooth estimate for the dark matter fields are com- 
puted using SPH interpolation as well, with a smoothing 
length chosen to given of order ~ 32 neighbour contri- 
butions per particle (Appendix A) . 



MODELS USING ANALYTIC 
DESCRIPTIONS OF THE DARK 
MATTER DISTRIBUTION 
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3.1 The lognormal model 

We have started with the model introduced by Bi and 
coUaborators (Bi et al. 1992, 1995; Bi 1993; BD97) for 
generating a Lya absorption spectrum along a LOS. 
This simple model predicts many properties of the ab- 
sorption lines, including the column density distribution 
and the distribution of line widths (6 parameters), which 
can be directly compared with observations (BD97). Re- 
cently, the BD97 model has been used by Roy Choud- 
hury et al. (2000, 2001) to study neutral hydrogen cor- 
relation functions along and transverse to the line-of- 
sight. Feng & Fang (2000) adopted the BD97 method 
to analyse non-Gaussian effects on the transmitted flux 
stressing their importance for the reconstruction of the 
initial mass density field. Viel et al. (2002) implemented 
a variant of the BD97 model to simulate multiple sys- 
tems of QSOs and found correlations in the transverse 
direction in agreement with observations (see also Petry 
et al. 2002 for an analysis of the correlations in the trans- 
verse direction). 

The BD97 model is based on the assumption that 
the low-column density Lya forest is produced by 
smooth fluctuations in the intergalactic medium which 
arise as a result of gravitational growth of perturbations. 
Since the fluctuations are only mildly non-linear, density 
perturbations in the intergalactic medium 5j'^^(x, z) 
can be related to the underlying DM perturbations by 
a convolution, which models the effects of gas pressure. 
In Fourier space one has: 

Sf^{}^,z) = WiGM{k,z)D+{z)5^^{\^) (1) 

where D+[z) is the growing mode of density perturba- 
tions (normalised so that -D+(0) — 1) and 5^^{\C) is the 
Fourier transformed DM linear over density at z = 0. 
The low-pass fllter WiGm{k,z) = (l + feV^j)"^ depends 
on the comoving Jeans length 



k:j\z) = H-, 



2-ykBTm{z) 
3/impf2om(l + z) 



1/2 



(2) 



where ks is Boltzmann's constant, Tm the gas temper- 
ature, ^ the molecular weight and 7 the ratio of spe- 
cific heats, riom is the present-day matter density. The 
Jeans length depends on density and temperature. Jeans 
smoothing is therefore an adaptive smoothing of the 
density field. Ifowever, for simplicity in most practical 
implementations the comoving Jeans length is assumed 
to be constant and computed for the mean density and 
the mean temperature To at mean density 5 = 0. As we 
will see later this is a severe restriction which signifi- 
cantly affects the results. 

Gnedin & Hui (1998) adopt a different and more ac- 
curate expression for the IGM fllter Wicmik, z), which, 
however, does not allow a simple matching with the 
non-linear regime (see also the discussion in Section 
[4.l[ ). More accurate window-functions have also been 
proposed by Nusser (2000) and Matarrese & Mohayaee 
(2002). In what follows, we take To{z) oc 1 + z, which 
leads to a constant comoving Jeans scale. This assump- 
tion should not be critical as the redshift intervals con- 
sidered here are small. 

Bi & Davidsen adopt a simple lognormal (LN) 



transformation (Coles & Jones 1991) to obtain the IGM 
density in the mildly non-linear regime from the linear 
density held. 



1+5igm(x, z) = exp 



So (x, 2) 



m'''^r)Di(z) 



(3) 



where 1 + 5igm{x,z) = niGM(x, 2)/niGM(2) and 
niGM(2) ~ 1.12 X 10-^nobh'\l + zf cm-^ The IGM 
peculiar velocity v^*^"^ is related to the linear IGM den- 
sity contrast via the continuity equation. As in BD97, 
we assume that the peculiar velocity is still linear even 
on scales where the density contrast gets non-linear; this 
yields 



v'«'"(k, z) = E+iz)'^WjGM{k, 2)5o°^(k) 



(4) 



with E+{z) = H{z)f{n,„,nA_)D+{z)/{l + z). Here 
/(Jlmif^A) = —din D+{z) /dln{l + z) (e.g. Lahav et al. 
1991, for its explicit and general expression) and H{z) 
is the Hubble parameter at redshift z, 



H{z) = Hoy/Qomil + zy^ + ^0^(1 + 2)2 + S^OA 



(5) 



where floA is the vacuum-energy contribution to the cos- 
mic density and Qoiz = 1 — ^Om — ^oa {^oti ~ for a 
flat universe) . The procedure to obtain ID LOS random 
flelds from 3D random fields is described in BD97 and 
Viel et al. (2002). We will explore below a more accurate 
mapping between the linear and non-linear gas density 
which uses the PDF of the gas density obtained from 
hydro-dynamical simulations. 

The neutral hydrogen density can be computed 
from the total density, assuming photo-ionisation 
and taking the optically thin limit, nHi(x, z) = 
/Hi(r, r,y)nH(x,2), where nu = (l-y)niGM and Y is 
the helium abundance by mass. The photo-ionisation 
rate F = x 10~^'^s~^ is related to the spectrum 
of ionising photons as F = J A-Kj{v)/hpVa-{v)dv, 
where aiv) is the photo-ionisation cross-section and 
hpvth the hydrogen ionisation threshold {hp de- 
notes Planck's constant). J{i^) = J2i{i'o/v)"^ x 
10~^^erg s~^Hz~^cm~'^sr~^ with Uth the frequency of 
the HI ionisation threshold, and m is usually assumed 
to lie between 1.5 and 1.8. In the highly ionised case 
{nm ^ niGu) of interest here, one can approximate the 
local density of neutral hydrogen as (e.g. Hui, Gnedin 
& Zhang 1997) 



nHi(x, z) 
niGM(z) 



10~ 



0.019 

r(x,2) 



r-12 
0.5 



l + z 



(l+<5IGM(x,2))^(6) 



The temperature of the low-density IGM is deter- 
mined by the balance between adiabatic cooling and 
photo-heating by the UV background, which establishes 
a local power-law relation between temperature and 
density, T(x,z) = To{z){l + 5''^'^(x, z))^("'-\ where 
both the temperature at mean density To and the adi- 
abatic index 7 depend on the IGM ionisation history 
(Meiksin 1994; Miralda-Escude & Rees 1994; Hui & 
Gnedin 1997; Schaye et al. 2000; Theuns et al. 2001). 
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Given the neutral density, the optical depth in 
redshift-space at velocity u (in km s~^) is 



r(u) = 



dynm{y)V [u - y - v\\^^{y),b{y)\ (7) 



where cro.a = 4.45 x 10~^* cm^ is the hydrogen Lya 
cross-section, y is the real-space coordinate (in km s~^), 

V is the standard Voigt profile normalised in real-space, 
and 6 — (2fc_Br/mc^)^^^ is the thermal width. Velocity 

V and redshift z are related through d\/\ = dv/c, where 
A = \o{l-\- z). For the low column-density systems con- 
sidered here, the Voigt profile is well approximated by 
a Gaussian: V = {^/^b)-^ e^p[-{u -y- vp^(2/))V&^]. 
As stressed by BD97 peculiar velocities aflect the opti- 
cal depth in two different ways: the lines are shifted to 
a slightly different location and their profiles are altered 
by velocity gradients. In our modelling, we treat r_i2 
as a free parameter, which is varied the observed effec- 
tive opacity rcfr(2;) = — ln{exp (— r)) is matched (e.g. 
McDonald et al. 1999; Efstathiou et al. 2000) at the 
median redshift of the considered range (tch — 0.12 and 
Teff = 0.27 at z = 2.15 and z = 3, respectively, in our 
case). We determine r_i2 by requiring that the ensem- 
ble averaged effective optical depth is equal to the ob- 
served effective optical depth. The transmitted fiux is 
then simply = exp(— r). 



3.2 Improving the mapping from the linear 

dark matter density to the non-linear gas 
density 

In this subsection we describe how the mapping from 
the linear DM density to the non-linear gas density can 
be improved. As described in the previous section the 
semi-analytical modelling of Lya forest spectra involves 
two main steps: 

• smoothing of the linear DM density field to obtain 
a linear gas density field 

• a local mapping from the linear to the non-linear 
gas density. 

In this section we will substitute the log-normal 
mapping used by BD97 for the second step by a rank- 
ordered mapping from the linear gas density field to the 
PDF of the gas density in the hydro-simulation. Fig- 
ure 1^ compares the PDF of the DM density field ob- 
tained using SPH interpolation and that of the gas den- 
sity field. Pressure forces push the gas out into the sur- 
rounding voids ('Jeans smoothing') and for this reason 
the PDF of the gas drops below that of the DM density 
at low densities. At high density the gas is converted 
into stars and the PDF of the gas density drops again 
below the PDF of the DM density. As a result the PDF 
of the gas density is more peaked (Theuns, Schaye & 
Haehneh 2000). Figure | compares the resulting proba- 
bility distribution of the flux for simulated spectra using 
the lognormal model our 'improved' model (hereafter 
PDF model) and the numerical simulations. For this 
comparison we have produced simulated spectra for the 
lognormal model and the PDF model with the same 
cosmological parameters as in our hydrodynamical sim- 



ulation. We further imposed a power law temperature- 
density relation with To = 10^ "^ K and 7 = 1.2, which 
fits the simulation well (Figure |^ . The simulated spec- 
tra have larger length (~ 16000 kms^^) than the spec- 
tra extracted from hydro-simulations (~ 1400 kms^^), 
but are obtained with the same spectral resolution, ap- 
proximately of 2kms~^. We take into account that the 
hydro-simulations is missing large scale power by ap- 
plying a similar cut-off in the power spectrum used to 
calculate the semi-analytical spectra. This should allow 
a fair comparison. The peculiar velocity field is again 
assumed to be given by linear theory (Eq. ^). All spec- 
tra are scaled to Te// = 0.27 at 2: = 3 and no noise is 
added. 

The probability distribution of the flux for our PDF 
model agrees significantly better with that of the spectra 
obtained from our hydro-dynamical simulation. There 
are still small but significant differences. While the 
agreement for this 1 point statistic seems reasonable the 
agreement becomes very poor for some 2-point statistics 
of the fiux. The function P{Fi, F2, Av)dFi dF2 is the 
probability that two pixels separated in the spectrum 
by a velocity difference Av, have transmitted fluxes in 
intervals dFi , dF2 around Fi and F2 , respectively. In the 
right panel of Figure |2| we show the mean flux difference. 



AF{Fi,5v) 



P{Fi,F2,5v){Fi- F2)dF2, 



(8) 



for 2 values of Fi, as a function of Av (see Miralda- 
Escude et al. 1997, Theuns et al. 2000, Kim et al. 2001). 
The results for the lognormal model and for the PDF 
model are almost identical, which is not surprising since 
we checked that the two functions that map the linear 
density fleld into the non-linear density held are similar. 

For strong absorbers (0 < -F" < 0.1) the agree- 
ment between the numerical simulation and our models 
which rely on an analytical description of the DM den- 
sity field is very poor. What could be the reason for this 
discrepancy? We have made the following major sim- 
plifications: (i) the peculiar velocity field is assumed to 
be linear; (ii) we do not simulate the intrinsic scatter 
present in the relation Tjgm vs. Sigm', (Hi) we recover 
directly the neutral hydrogen density along the LOS by 
assuming eq. (^), which is valid only under some as- 
sumptions (see Subsection 3.1); (iv) we input only the 
1-point probability distribution function of the IGM, 
without taking into account the higher order moments 
of the distribution; (v) the correlations are assumed to 
be those predicted by linear theory, modified by the 
non-linear mapping, while the correlations of the hydro- 
simulation are different. 

Concerning point (i) it has been shown by Hui et 
al. (1997) that the peculiar velocity field can affect the 
shape of the absorption features, while it has very small 
effect on the column density distribution functions of 
the lines. We have run the calculated spectra varying 
the peculiar velocity field and have not found a strong 
dependence of the PDF of the fiux on the peculiar ve- 
locity quantity. This means that this statistics is mainly 
infiuenced by the underlying density field. We have com- 
pared the PDF of the peculiar velocity field predicted 
by linear theory with the peculiar velocity found in the 
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Figure 1. Density distribution of the gas (IGM, continuous line) and dark matter (DM, dashed line) obtained with SPH 
interpolation over 300 LOS extracted from hydro-simulations at 2: = 3 for as ACDM cosmology. 
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Figure 2. Left panel: one-point function of the flux obtained from the hydro-simulations (dashed line), lognormal model (dotted 
line) and 'improved' model (PDF, continuous line). A total number of ~ 10® pixels is used in the computation. All the spectra 
are computed at 2: = 3 for a ACDM model. Right panel: mean flux difference AF{Fi, 5v), for hydro-simulations (dashed line), 
lognormal model (dotted line) and 'improved' model (PDF, continuous line). Two different flux intervals have been chosen 
which correspond to strong absorbers (0 < Fi < 0.1) and to an intermediate strength lines (0.3 < i<2 < 0.6). 



2 = 3 output of hydro-simulation for the IGM and we 
have found that the differences are not big. The assump- 
tion of a linear velocity field should therefore be a good 
approximation. 

If we try to simulate the intrinsic scatter in the 
relation T ~ S, left panel of Figure ^ for example by 
using as input a higher order polynomial fit instead of 
the power-law relation, the improvement in the PDF 
of the flux is negligible. Provided we set the same To of 
the simulations and a 'reasonable' value of 7, the power- 
law equation of state is a good approximation and the 
scatter can be neglected. 

The same considerations are valid for point (iii). In 
fact, we have found that eq. (ph is in good agreement 
with hydro-simulations (Figure^, right panel), showing 
that the approximations we made (optical thin limit, 



high ionisation state) are not affecting the results sig- 
nificantly. 

We thus believe that point (iv) and (v) are the main 
simplification involved in the model. This is somewhat 
expected as in the definition of the optical depth, eq. (Q) , 
the summation is done over nearby pixels, so it is influ- 
enced by the spatial correlations, which are assumed to 
be those predicted by the linear theory, modified by the 
local non-linear mapping. In the following Sections we 
will thus turn to models that are based on the DM den- 
sity field obtained directly from a TV-body simulation 
(Fig. ^ below). These models reproduce the flux cor- 
relation for strong absorbers much better. This demon- 
strates that correlations in the density field introduced 
by the non-linear evolution are indeed responsible for 
the shape of strong lines. These are not not reproduced 
accurately by our local mapping from the linear to non- 



© 0000 RAS, MNRAS 000, 000-000 



6 M. Viel, S. Matarrese, H. J. Mo, T. Theuns & M. G. Haehnelt 




Figure 3. Left panel: contour plot of log(Tf(3M) vs. log(l + Sigm), at ^ = 3. The dashed line is obtained by setting 7 = 1.2. 
Right panel: contour plot of log{Nf{x) (comoving neutral hydrogen density in cm~^) vs. log(l + 5igm) obtained from the ^ = 3 
output of the hydro-simulation. The dashed line is given by eq. Levels of contours increase by a factor 10. (a total number 
of 3 X 10^ points is used in the computation). 



linear gas density. The marked differences for strong ab- 
sorbers are not unexpected as the z = 3 output of hydro- 
simulation contains very strong absorption systems from 
fully collapsed objects which are not related to the Lya 
forest. The spectra of the LOGN model have been pro- 
duced using the Jeans length at the mean temperature 
and with a fixed value of 7. If we use other values for 
the temperature To and for 7 we can obtain a better 
agreement in other flux intervals. In Figure ^ the range 
of flux values in which the agreement with the flux PDF 
of the hydro-dynamical simulation and in /S.F is good 
corresponds to intermediate absorbers (0.2 < F < 0.4), 
this is in part determined by the requirement of having 
a fixed Teff = 0.27 for the ensemble of simulated spec- 
tra in each model, which scales the neutral hydrogen 
fraction. 



4 MODELS USING NUMERICAL 

SIMULATIONS OF THE DARK MATTER 
DISTRIBUTION 

4.1 Zel'dovich modelling of the gas 
distribution 

In this Section we present models of the gas distribu- 
tion based on a modified filtering of the initial condi- 
tions of the dark matter density field. The first method 
is based on the truncated Zel'dovich approximation 
(TZA). Among the possible approximations, TZA has 
been found to produce the best agreement with N- 
body results (see Coles et al. 1993, Melott et al. 1994, 
Sathyaprakash et al. 1995). Hui et al. (1997, here- 
after HGZ) showed that the TZA, with an appropriate 
smoothing and with a recipe which allows to convert 
density peaks into absorption lines, successfully repro- 
duces the observed column density distribution of the 
Lya forest lines over a wide range of Nhi- Gnedin & Hui 
(1998) presented a more accurate semi-analytical model 



of the Lya forest by combining a particle mesh solver 
modified to compute also an effective potential due to 
gas pressure. They showed that a particle mesh solver, 
with an appropriate filtering of the initial condition, can 
be used to model the low density IGM. All these differ- 
ent methods have the advantage of being much faster 
than a hydro-simulation. 

We are now going to make a comparison, LOS by 
LOS, between the simulated gas distribution and the 
effective gas distribution of the z = 3 output of the 
hydro-simulation. Our simulation is significantly larger 
and with higher resolution than those used in previous 
studies. 

Following Hui et al. (1997), we define a non-linear 
wavenumber kni (see also Melott et al. 1994): 

D\{z) / P(fc)d^k= 1, (9) 

where D+{z) is the linear growth factor for the density 
perturbations and P{k) the linear power-spectrum. We 
filter the linear density field with a Gaussian window 
W{k,ks) = exp(-fc^/2fc^), with ks ~ 1.5fc„;. At this 
point we displace particles from the initial Lagrangian 
coordinates q according to the truncated Zel'dovich ap- 
proximation: 

x(q,z) =q + I>+(z)Vq?i/(q), (10) 

with (f>f{q) the initial filtered velocity potential (see 
e.g. Coles et al. 1995), with ^(k) the actual potential 
used in the initial conditions of the simulation. The fil- 
tering is intended to prevent shell-crossing. To mimic 
baryonic pressure, we smooth the initial density field 
with a Gaussian window exp(— A;^/2fcj), with fcj given 
by eq. (^), as in HGZ. The final filtering scale is ef- 
fectively the smaller of ks and kj. In our case, ks ~ 
5.5Mpc~^ and kj ~ 7Mpc~'^, these wave numbers cor- 
respond to scales of As ~ 1.14 Mpc and A./ ~ 0.9 Mpc, 
respectively. This means that the amount of filtering. 
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before displacing particles, is given by the condition of 
eq. (|). 

We also outline here a further approach based on 
the assumption that the displacement between DM 
and IGM particles is 'Zel'dovich-like'. We refer to this 
method as the Zel'dovich Displacement (ZD method), 
while the method in which we assume a perfect tracing 
between the DM and IGM density field will be referred 
as the DM method. The difference, at redshift z, be- 
tween the Eulerian position of a gas and a DM particle 
which have the same Lagrangian coordinate q, accord- 
ing to the Zel'dovich approximation, is: 

Ax(q,2) = D+{z)[VciipiGAi{ci,z) - Vq<^DAf(q)], (11) 

where ipioM and (fiDM are the IGM and DM veloc- 
ity potentials. In Fourier space we have (Matarrese & 
Mohayaee 2002) i/)/GM(k, z) = WioMi'k, z)(j)DM{'ii) and 
from eq. ( |ll| ) we get: 

Ax(k,2) = D+{z)[WiGM - l]ik0z3M(k). (12) 

The above equation shows that for k <^ kj Wigm 
1, Ax(k,2:) 0, while for k > kj, Wigm 0, 
Ax(k, 2;) —> — ik(/)_DA/(k). In the first limit, Jeans 
smoothing is not effective and the IGM traces the DM, 
in the second limit the effect of gas pressure prevents 
large displacements of baryons from their initial loca- 
tion. According to this method, the displacement be- 
tween DM and IGM particles with the same Lagrangian 
coordinate is given by a filtering of the initial DM den- 
sity field. We calculate this displacement at z = 3 with 
the filter proposed by Gnedin & Hui (1998), i.e. in- 
stead of Wigm we use Wzd ~ exp{—k'^/2k%j^), with 
kzD ^ 2.2kj. This choice is motivated by the fact that 
a Gaussian filter gives an excellent fit to baryon fiuc- 
tuations for a wide range of wave numbers. The filter- 
ing scale kzD is the value predicted by linear theory at 
z = 3, assuming that reionization takes places at z ~ 7 
(Gnedin & Hui 1998). The location of the gas particle 
is then found be adding this displacement to the ac- 
tual position at z = 3 of the DM particle with the same 
Lagrangian coordinate. 

In Figure ^ we show four panels, which are slices of 
thickness ~ 0.1 comoving Mpc at the same position 
along the z-axis, for the IGM and the DM distribution of 
the hydro-simulations (top panels), the TZA field (bot- 
tom left panel) and the IGM field obtained with ZD 
method (bottom right). 

By comparing the top panels, one can see that the 
gas is more diffuse than the dark matter. TZA repro- 
duces the main filaments but we know that the agree- 
ment will be better in the low-density regions. ZD (with 
a filtering at kzo ~ 16Mpc~^) seems promising, at 
least by eye. This method allows some diffusion around 
the dark matter to reproduce the gas distribution. A 
more quantitative comparison is needed to see how good 
our IGM density field is compared to the hydro one. 
We compare LOS by LOS the SPH interpolated IGM 
density fields with the 'true' IGM field of the hydro- 
simulation. This test is pretty severe as we are not going 
to filter the density fields and the comparison is made 
'pixel- by-pixel'. 

The results are shown in Figure Isl where we compare 



the amount of scatter predicted by the TZA approxima- 
tion (left panel) and the ZD method (right panel). As 
expected, the test without any filtering on the final den- 
sity field shows that the TZA tracks the simulation, but 
the scatter is very large. The ZD method agrees better 
with the simulation, but the scatter is still rather signif- 
icant. A more detailed comparison is shown in Figure ^, 
where we plot the mean, and scatter around the mean, 
as a function of density. 

Clearly ZD works significantly better than TZA, 
the average value is better reproduced and also the 
scatter is significantly smaller. The inadequacy of the 
TZA was also shown by Bond & Wadsley (1997). The 
amount of scatter and the average value found with the 
ZD method are consistent with the results of Gnedin & 
Hui (1998; see their Figure 4). 

4.2 Using the mean Sdm — Sigm relation of the 
hydro simulation to predict the gas 
distribution from the DM distribution. 

The results of the previous section have demonstrated 
that 'filtering' techniques on the initial conditions do 
not result in density fields that agree well with hydro- 
simulations, at least for a 'pixel-by-pixel' comparison 
with a high-resolution simulation. 

In this section we present an alternative approach 
that starts from the actual DM density distribution of 
the numerical simulation and "predicts" the gas distri- 
bution using a fit to the mean relation between gas and 
DM density. In this way the displacement of the gas 
with respect to the dark matter is modelled statistically 
in real space. The techniques of the previous Section are 
all based on different filtering schemes which smooth 
the IGM density field over a constant scale set by the 
Jeans length at the mean density. This filtering is done 
in Fourier space and leads to equal smoothing of all dark 
matter structures independent of their density. As dis- 
cussed above this strong simplification is responsible for 
the rather strong discrepancies with the gas distribution 
in our hydro-dynamical simulation. 

We compute IGM and DM over densities, IGM 
peculiar velocity and IGM temperature obtained with 
SPH interpolation (see Appendix |^ for details) . From 
the computed IGM and DM density profiles along dif- 
ferent LOS we argue that a simple relation between 
the DM and the gas IGM density fields does not ex- 
ist (see also Cen et al. 1994, Croft et al. 1998; Zhang et 
al. 1998). For a wide range of moderate over densities, 
i.e. —0.75^ 5^ 5, the PDF of the gas distribution lies 
above that of the DM (Fig. ^): this of course does not 
determine the relation between gas and dark matter, 
but shows that we should expect a significant number 
of regions in which Sigm is larger than 5dm. 

This is what we find in the hydro-simulations in 
which there are regions at high 5 where the gas is more 
concentrated than the dark matter. At low 5 there is 
an almost perfect agreement between the gas and the 
dark matter distribution, while at moderate and large 
over densities the relation between gas and dark mat- 
ter is not unique. The highest density peaks are usually 
related to peaks in the temperature of the gas of and 
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Figure 4. Slices along the z-axis of thickness ~ O.l/i^^ comoving Mpc (coordinates in normalised units). Top panels: IGM 
distribution (left) and DM distribution (right), from the z = 3 output of the ACDM model. Bottom panels: Zel'dovich modelling 
of the IGM distribution from the initial conditions of hydro-simulation (left panel, TZA), Zel'dovich displacement added to DM 
particles to mimic baryonic pressure (right panel, ZD). 



strong gradients in its peculiar velocity, suggesting the 
presence of a shock. However, it can happen that a re- 
gion shows a gas density more peaked than the DM, 
while the velocity field and the temperature show no 
particular behaviour. This means that modelling of the 
gas distribution using only the information along the 
LOS will not be very accurate on a point-to-point basis, 
but we might nevertheless hope to obtain a model for 
the gas distribution which has the correct properties in 
a statistical sense. 

We start by plotting the values of Siqm vs. Sdm 
obtained from the simulation at different redshifts. Fig- 



ure 1^ demonstrates that the scatter between the IGM 
and DM densities increases with decreasing redshift, as 
expected. At z — 10, there is almost perfect agreement 
between the two fields, but when cosmic structures get 
non-linear the physics becomes more complex. In addi- 
tion, reionization occurs in our simulation at 2; ~ 6, so 
before that the simulations do not resolve the then much 
smaller Jeans length. At later redshifts, the effect of the 
baryonic pressure can be seen. For under dense regions, 
the IGM density tends to be higher than the DM den- 
sity, because pressure is pushing gas into the low density 
voids. For larger values of 5dm the scatter increases and 
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Figure 5. Contour plots of the true IGM over density derived from tiie iiydro-simulation and tiie simulated IGM over densities 
obtained with the truncated Zel'dovich approximation (TZA, left panel) and the 'Zel'dovich displaced' method (ZD, right 
panel), at 2 = 3. kj for ZD is ~ 16Mpc~^, ks for TZA is ~ 5.5Mpc~^. In both the panels, the number density of the elements 
increases by an order of magnitude with each contour level (a total number of 3 X 10* points are used in the computation). 
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Figure 6. Average and rms value of the difference between logSIM(l + (5/(3M) and logTRUE(l + <5/(3jv/)i with the TZA (dashed) 
and ZD method (continuous). Thick lines represent the rms values, thin lines the average. 



gas can be either denser or less concentrated than the 
dark matter, depending on star formation which sets-in 
in the simulations at over densities > 80 (see Aguirre, 
Schaye and Theuns 2002 for a more detailed description 
of the star formation recipe adopted). 

We fit the relation between gas and DM density 
with a 3rd order polynomial; the results are shown in 
Figure ^ where the range of 5dm plotted is the one of 
interest for the Lya forest, — li$ 5^ 6. If we set y — 
log(l + 5/GA/) and x — \og{l + 5dm) the fitting function 
is; 

y = 0.02 +0.91 X — 0.15 x 

- 0.08*" °'' 2;^ -1.5 < 2; < 1.0, (13) 

where the errors reported are the la uncertainties of 
each coefficient. The same function has been found to 



be a reasonable good fit at 2: = 4 and 2: = 2 as well, so 
it can be used to simulate the gas distribution in this 
redshift range for a ACDM model. From Figure ^ one 
can see that the fitting function predicts that the gas 
is on average more concentrated than the dark matter 
for a wide range of 5dm, although significant scatter is 
present. 

Is it possible to reduce this scatter using other in- 
formation? 

In Figure ^ (left panel) we plot the peculiar velocity 
gradient of the dark matter density fields vs. the differ- 
ence A between the 'true' value of log(l + 5igai) and 
the 'fit' value obtained with eq. (|l^). Both these quanti- 
ties have been smoothed over a scale of ~ lOOkms"^ to 
reduce noise. A is weakly anti-correlated with the dark 
matter peculiar velocity gradient, with regions with a 
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Figure 7. Scatter plots of log(l + Sigm) vs. log(l + 5dm)^ at z = 10 (top left), z = 3.5 (top right), z = 3 (bottom left) and 
z = 2.25 (bottom right) of the hydro-simulation. 30 LOS of 2^" pixels are reported here. Levels of contours increase by a factor 
of 10. 



1.0 




Figure 8. IGM density contrast vs. DM density contrast, at z = 3 for the ACDM model. The continuous line is the 3rd order 
polynomial fit, the dashed line is obtained by setting Sjqm = Sdm- 



negative gradient lying above the mean fit and iience 
have A > 0. On average, a negative velocity gradient 
indicates that the gas is being compressed and so may 
be undergoing moderate or strong shocks. As a conse- 
quence, the gas is also being heated, and this introduces 



the correlation between A and the temperature of the 
gas (Fig. 1^, right panel). This correlation is stronger, 
because the gas temperature is a more direct indicator 
of a shock. Conversely, negative A, i.e. points below 
the fitting function, are related to colder regions of gas 
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Figure 9. Scatter plots of the dark matter peculiar velocity gradient along the LOS vs. A (left panel) and temperature of the 
IGM vs. A (right panel) . where A = log(l + &igm) — fit, i.e. the difference between the true value and the value obtained with 
the fit, (see eq. h3). Continuous line in the left panel represents the fit we have used in simulating the IGM density field. Weak 
correlations are present. Levels of contours increase by a factor of 10. 



and small positive gradients of the dark matter pecu- 
liar velocity field. As expected, the bulk of the points 
is in quiet regions with dvpe.c/dx ~ and temperatures 
between lO*'^ K and 10*-^ K. 

These results suggest that we can indeed model the 
gas distribution that corresponds to a given dark matter 
density and peculiar velocity distribution statistically. 
We use the fit of eq. (jl^) to predict the gas density 
field from the dark matter density field. The modelling 
can be further improved by taking the peculiar veloc- 
ity of the dark matter into account. For this purpose 
we fit the relation shown in the left panel of Figure |^ 
with a 2nd order polynomial because the correlation is 
weak. Models obtained using the density-density fit and 
models using a two parameter density-density plus the 
density-peculiar velocity fit will be referred to as FIT-1 
or FIT-2, respectively. 

The fitting function of eq. (^^ gives a good approx- 
imation to the gas distribution for a ACDM model in the 
redshift range 2< z< 4. The same technique can be ap- 
plied to other cosmological models and different redshift 
ranges. We will now make a LOS by LOS comparison 
for the the gas density distribution of the two models 
FIT-1 and FIT-2 to that of our hydro-dynamical sim- 
ulation. In Figure ^ we show the scatter plots for the 
gas (over) densities for the two models. Figure [lo| shows 
the scatter in terms of the standard and mean deviation 
for both models. The simple DM model where the gas is 
assumed to trace the dark matter exactly is also shown 
(thick, thin and dotted Unes, respectively). 

Figure |l^ illustrates the improvement of using the 
fitting procedure of model FIT-2. The for these fits, 

where = N'^l E^T ^^T^ (^W" = 25 is the num- 
ber of bins) are 0.24, 0.21 and 0.19 for the FIT-1, FIT-2 
and DM, respectively, showing that the scatter is effec- 
tively reduced with the second method but still in the 
total interval — 1.5 < log(l + 5igm) < 1- 



When the density range is constrained to be in the 
interval —1 < log(l -I- Sigm) < 0.6, relevant for the 
Lymana forest, we obtain of 0.06, 0.03 and 0.10 for 
the FIT-1, FIT-2 and DM model respectively, illustrat- 
ing the improvement of the fitting procedure. Compar- 
ing with Figure ^ we see that the FIT-2 method is 
significantly better than ZD in reproducing the mean 
values for log(l -|- 5/gm)~ 0.5 (however, for larger over 
densities ZD shows a better agreement than FIT-2 or 
FIT-1), while the rms values are basically equivalent. 
If we apply the fitting procedure in the range Sjgm^ 1 
which represents the bulk of the IGM, the mean and rms 
deviation between the fitted and true IGM densities are 
smaller than 10 and 30 per cent, respectively. 



4.3 Jeans smoothing of the evolved DM 
density field 

Here we investigate the effect of smoothing the evolved 
rather than the initial DM density field on a constant 
scale set by the Jeans length at mean density. We use 
the DM distribution of the z = 3 output of the hydro- 
simulation to compute the dark matter density field on 
a cube of 128^ mesh points using SPH interpolation as 
described in the Appendix. The cell size of this mesh 
of ~ 0.14 co-moving Mpc approximately resolves the 
Jeans length Xj ~ IMpc. We have to choose a mesh 
because the filtering is done in 3D using fast Fourier 
transforms. However, we note that also ID smoothing 
along LOS is in rough agreement with the 3D one. To 
model the Jeans smoothing we convolve the DM density 
field with a Gaussian filter W = exp(— fc^/2 fc^), with 
kf — kj ^ 7Mpc^^. By comparing Figure ^ with Fig- 
ure |l^ one can see that the Jeans smoothed dark mat- 
ter density field is very different from IGM distribution 
in the hydro-simulation. Given this large discrepancy 
between hydro-simulations and Jeans smoothed dark- 
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Figure 10. Average (thin lines) and rms (thick Unes) value of the difference between log S1M(1+(5/gm) a^nd log TRUE(1+(5/gm)i 
for the FIT-2 (continuous lines), FlT-1 (dashed lines) and DM (dotted lines) methods. 
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Figure 11. Left panel: true versus simulated (FlT-1 model, Eq. Sjqm at z = 3 for the ACDM simulation plotted vs 
simulated Siqm (FIT-1 model) obtained, with the fit of eci. (^). Right panel: true Sjom versus that obtained using FIT-2 
model, improved with the fitting of the scatter (see Figure M). Contour levels are off-set by 1 dex, and are based on 3 X 10^ 
points. 



matter density field we choose not to analyse into the 
details the density statistics as we did for the models of 
the previous subsection. 

The main reason for the large difference between 
the true IGM density distribution, and the one obtained 
from Jeans smoothing the DM density field, is the sim- 
pUfication of a constant smoothing length. In reality, 
the Jeans length depends on temperature, and there- 
fore should be adaptive. Unfortunately, Fourier space 
filtering techniques smooth all structures in the same 
way, since they are global operations. In low-density 
regions, the amount of smoothing in hydrodynamical 
simulations is small, but the Jeans smoothed density 
field is very different from the original DM field. On the 
other hand, for large over densities of the dark matter, 
Jeans filtering at the mean gas density underestimates 



the amount of smoothing, the region at Sigm < 5dm, 
for large values of 5dm, is more populated in Figure ^ 
than in Figure |^ As we will see in the next section the 
resulting flux statistics are very different as well. 



5 A COMPARISON OF FLUX STATISTICS 
FOR THE IMPROVED METHODS 

In this section we compare the flux distribution obtained 
with the different methods for modelling the gas den- 
sity fleld. Figure |l^ shows the 1-point probability dis- 
tribution function for the flux (left panel) and the mean 
flux difference /^F (Eq.^, right panel), for three mod- 
els; (i) DM (dotted line) is the PDF we get by set- 
ting 5iGM = 5dm', (ii) ZD (dashed lines) is the PDF 



© 0000 RAS, MNRAS 000, 000-000 



Modelling of the gas distribution: the Lya forest 13 




log log (1+5) 

Figure 12. Left panel: scatter plot of the dark matter density field smoothed on the Jeans length at 2 = 3 and the dark matter 
density field; both these fields are evaluated with SPH interpolation in 128^ mesh points. 3 X 10^ points are shown; the 1 line is 
obtained by setting Sigm = ^DM- This plot has to be compared with Figure^, which shows the results from hydro-simulations. 
Right panel: PDF of the dark matter (continuous line) and PDF of the Jeans smoothed dark matter density field (dashed line). 
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Figure 13. PDF of the flux (left panel) and A_F which, is related to the 2-point PDF of the flux (right panel). Comparison 
between different methods: FIT-2 (continuous line), ZD (dashed), DM (dotted). The PDFs extracted from hydro simulations 
are represented by the triangles (HYDRO). This plot has to be compared with Figure (H). 



obtaine d w ith the 'Zel'dovich Displacement' method of 
with a fc/ ~ 16Mpc"^ (m) FIT-2 (continu- 



Section 4.1 



ous line) is the PDF obtained with the fitting technique 
of the previous Section and adding the scatter inferred 
from the dark matter peculiar velocity field. We also 
show the flux PDF of the hydro-dynamical simulation 
(triangles) which is the true PDF extracted from the 
z = 3 output of the ACDM model. 

All these methods try to predict the gas distribu- 
tion from the dark matter distribution of the numerical 
simulation. We thereby assume that vhi ~ vjgm ~ 
vdm which we have checked to be a good approxima- 
tion. The local abundance of neutral hydrogen is com- 
puted from the ionisation equilibrium equation, eq. 
We further need to assume the temperature at mean 
density, To, and the slope of the temperature-density 
relation, 7. We take To = lO^'^K and 7 = 1.2. AU sim- 



ulated spectra have been scaled to the same effective 
optical depth, r^ff ~ 0.27 (which is a good fit to the 
observed effective optical depth at redshift z = 3). 

The PDF of the flux is very similar for the three 
models. We performed a quantitative comparison based 
on the Kolmogorov-Smirnov (KS) test which charac- 
terises the difference between two models from the max- 
imum absolute deviation, dKS, between the two cumu- 
lative ffux distributions (see Meiksin et al. 2001 for fur- 
ther details on KS test applied to the PDF of the flux). 
We calculate (Iks for the lognormal model (LOGN) and 
the 'improved' model (PDF) obtained by implementing 
the 1-point probability distribution function of the IGM 
as well. Fluxes obtained with TZA have not been com- 
puted as the large scatter found (left panel. Figure |) 
suggests that this approximation is the least accurate. 
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Table 1. dj^g values for the different methods proposed. 



Model dj^s 

FlT-2 0.014 

FlT-1 0.018 

ZD 0.042 

DM 0.049 

PDF 0.102 

LOGN 0.147 



The KS test was performed for a total of 3 ■ 10^ pixels 
(Table §). 

The best agreement with the hydro-simulation is 
obtained with the FIT-2 method. The KS indicates a 
slightly better agreement than for the FIT-1 method. 
The ZD method gives better agreement than assum- 
ing that the gas traces the dark matter. The PDF and 
LOGN methods are the least accurate; this means that 
the proposed fitting methods determine a better agree- 
ment in terms of the pdf of the fiux and can be consid- 
ered as an effective improvement compared to the PDF 
and LOGN methods. 

The methods based on the actual DM density dis- 
tribution (FIT-1, FIT-2, DM) also reproduce the 2-point 
flux distribution of the hydro-simulation dramatically 
better than those based on linear theory or the lognor- 
mal model as can be seen by comparing Fig. |l3| (right 
panel) with Fig. ^. Note especially the significant im- 
provement in the shape of strong lines. As discussed 
above this implies that this statistic is mainly infiuenced 
by the correlations in the underlying dark matter den- 
sity fields which are now the same for all the models. 

We choose to make a final plot to quantify the dif- 
ferences between the Jeans smoothed dark matter den- 
sity field and the hydro-simulations in terms of the PDF 
of the fiux. Results are shown in Figure The number 
of pixels per LOS is 128, significantly lower than in the 
comparisons between the other improved methods 
To produce 'Jeans smoothed' spectra we assume again 
that vhi ^ vdm and a power- law equation of state for 
the dark matter over density. This time there are also 
significant differences between the the two models also 
in terms of the PDF of the ffux. The hydro PDF predicts 
more dense regions than the Jeans smoothed one, and 
correspondingly less regions with small densities. This 
is somewhat counterintuitive, since the fitting technique 
smooths less in low density regions compared with Jeans 
smoothing. However, we require both types of spectra 
to have the same Te//. Effectively, this means that the 
two PDFs for S are multiplied by a correction factor, 
which can be seen as a scaling of the neutral hydrogen, 
in such a way that the hydro PDF has a larger number 
of regions at large S than the Jeans smoothed field. The 
(Iks of the two distribution is ~ 0.1 slightly better than 
the LOGN model value. 

The main conclusions of this Section are: (i) the use 
of the numerically simulated dark matter density distri- 
bution for the prediction of the gas density distribution 
results in a significant improvement in the comparison 



with the hydro-simulations; (ii) the modelling of the gas 
distribution using the fit to the mean relation between 
dark matter and gas density in the hydro-simulations re- 
sults in much better agreement than the methods where 
a filtering scheme is applied to the initial or evolved den- 
sity field. 



6 DISCUSSION AND CONCLUSIONS 

Many aspects of the warm photo-ionised Intergalactic 
medium can be well modelled by hydrodynamical sim- 
ulations. These are, however, still rather limited in dy- 
namic range and lack the possibility of extensive param- 
eter studies due to limited computational resources. In 
order to overcome these problems we have tested several 
approximate methods for simulating the Lya forest in 
QSO absorption spectra. The modelling consists of two 
main steps (i) modelling the DM distribution and (ii) 
mapping the DM distribution into a gas distribution. 

Methods which use an analytic description of the 
DM distribution like the lognormal model give a rather 
poor description of the gas distribution compared to 
numerical simulations. In Section ^ we have shown that 
this results in fiux PDFs which are in rough agreement 
with the PDF extracted from hydro- simulations. The 
agreement can be improved with a variant of the log- 
normal model where a mapping between linear and non- 
linear IGM density fields calibrated by hydro-dynamical 
simulations is used. However, in both cases, the 2-point 
PDF of the fiux differs significantly from that obtained 
from numerical simulations. This is because the 2-point 
PDF is strongly affected by the underlying correlations 
in the DM density field, which are not well reproduced 
in models based on an extrapolation of linear theory. To 
take properly account of these correlations on a point- 
to-point basis numerical DM simulations are required. 

We thus tested a variety of schemes to relate the 
DM distribution of numerical simulations to the gas 
distribution. These schemes are supposed to take into 
account that the gas is smoothed on a Jeans scale rel- 
ative to the dark matter. We first investigated two ap- 
proximations based on the Zel'dovich approximation, 
the truncated Zel'dovich approximation (TZA) and a 
scheme which we call Zel'dovich displacement (ZD). The 
latter is based on the assumption that the displace- 
ment between DM and IGM at the same Lagrangian 
coordinate depends on the initial DM density field, fil- 
tered on a suitable scale. The TZA reproduces the gas 
density field very poorly in a LOS by LOS compari- 
son. It is actually worse than if we assume that the 
gas traces the DM faithfully. The ZD method which al- 
lows diffusion on a scale smaller than half of the Jeans 
length to mimic baryonic pressure fares somewhat bet- 
ter. The scatter in plots of predicted vs. simulated densi- 
ties is nevertheless only slightly smaller than in a model 
where gas traces dark matter. We have also tested the 
ZD method with smoothing on a global Jeans length 
and have again found poor agreement with spectra ex- 
tracted from hydro-simulations. It may be that filtering 
techniques based on the Zel'dovich approximation can 
reproduce some statistical properties of the Lya forest 
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Figure 14. PDF of the flux obtained from the Jeans smoothed dark-matter density field (continuos line). Filled triangles 
represent the hydro-simulation PDF. 



like the column density distribution reasonably well, but 

they fail in reproducing the flux distribution in detail. 

To make progress we have thus investigated the re- 
lation of the gas density and the DM density in the 
hydro-dynamical simulation in more detail. The rela- 
tion between Sigm and 6dm can be well fit with a 3rd 
order polynomial. There is considerable scatter around 
the mean relation and we have examined the correla- 
tions between deviations from the fit with other physical 
quantities along the LOS. There is a weak correlation of 
these deviations with the filtered dark matter peculiar 
velocity gradient and a somewhat stronger correlations 
with the gas temperature. This indicates that the devi- 
ations are due to moderate or strong shocks in the gas 
component. 

Combining the DM simulations with the fitted re- 
lation between DM density and gas density gives good 
results for both the one and two-point distribution of 
the flux. If we introduce an additional correlation of the 
gas density with the DM peculiar velocity gradient as 
found in the hydro sirrmlations the agreement is further 
improved (a method which we called FIT-2). These flt- 
ting methods give significantly better results than the 
other methods we have discussed. 

Smoothing of the dark matter density field with 
a constant global Jeans scale calculated for mean den- 
sity and temperature results in a gas distribution very 
different from that found in hydro-simulations. This is 
not too astonishing as it does not take into account 
that the .Jeans length depends on temperature and den- 
sity. Jeans smoothing at the mean temperature over- 
estimates the smoothing in low density regions and 
underestimates it at higher density. This leads to sig- 
nificant differences of the fiux statistics compared to 
hydro-simulations. Smoothing of the evolved dark mat- 
ter density field on a global Jeans scale is therefore not 
a promising technique. The reason for the success of 
our FIT-1 and FIT-2 schemes is their adaptive nature 
which takes into account - at least to some extent - the 
density /temperature dependence of the Jeans scale. 

We conclude that large high-resolution DM simular 



tions combined with a two-parameter fit of the DM den- 
sity gas density relation obtained from hydro-dynamical 
simulations are the best compromise between computa- 
tional expense and accuracy when a large dynamic range 
and/or an extensive parameter study are required. 
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APPENDIX A: SPH INTERPOLATION 
ALONG LOS 

In this Appendix we describe the SPH computation of 
physical quantities along a given line of sight through 
the box. We follow the same procedure described in 
(Theuns et al. 1998). We divide the sight line into 
N ~ 2^" bins of width A in distance x along the sight 
line. For a bin j at position x{j) we compute the density 
and the density weighted temperature and velocity for 
the gas and density and weighted velocity for the dark 
matter from: 

ipv)x{j) = ^«x(,)Wi,- (A2) 
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(A3) 



Dobrzycki A Bcchtold J.. Scott J., Morita M., 2001, 



X{i) is a label indicating the abundance of species X 
of particle i (X = Hi ; X =IGM and X =DM denotes 
neutral hydrogen, total gas and dark matter density, 
respectively) . 

Here, Wij = mW{qij)/hf and m is the SPH par- 
ticle mass which is the same for all SPH particles (but 
different for DM and gao partioloa). For W wo uao the 



MA spline (I\/[onaghan 1992) gi^ren by 



W{q) = -(1. +g^(-1.5 + 0.75g)) ifg< 1 

TV 

= i(0.25(2. - qf) if 1 < (? < 2 

TV 

— elsewhere. 

We have defined: 
|x«-x(j)| 



qij = 



hi 



(A4) 



(A5) 



where x(i) and hi are the position and SPH-smoothing 
length of particle i. Note that h is defined in such a 
way that on average 32 particles are within 2h{i) from 
particle i. In this way, for each pixel along the LOS, we 
compute the contribution of all the particles which in- 
fluence this region with a weight given by eq. (A4); this 
is the 'scatter' interpretation (see, for example, Hern- 
quist & Katz 1989). For the computation of the spectra 
we label bins according to velocity and we adopt the 
procedure described in Section |. 
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